Machine Learning Potential Model Based on Ensemble Bispectrum Feature Selection and Its Applicability Analysis
نویسندگان
چکیده
With the continuous improvement of machine learning methods, building interatomic potential (MLP) based on datasets from quantum mechanics calculations has become an effective technical approach to improving accuracy classical molecular dynamics simulation. The Spectral Neighbor Analysis Potential (SNAP) is one most commonly used potentials. It uses bispectrum encode local environment each atom in lattice. hyperparameter jmax controls mapping complexity and precision between descriptor. As increases, description will more accurate, but number parameters descriptor increase dramatically, increasing computational complexity. In order reduce without losing accuracy, this paper proposes a two-level ensemble feature selection method (EFS) for descriptor, combining perturbation selector strategy. Based proposed method, subset selected original dataset dimension-reduced MLP. application validation, data Fe, Ni, Cu, Li, Mo, Si, Ge metal elements are train linear regression model SNAP predicting these metals’ atomic energies forces them evaluate performance subsets. experimental results show that, compared features qSNAP, training our EFS qSNAP than SNAP. Compared with existing when size 0.7 times that features, SSWRP strategy can achieve best terms stability, achieving average stability 0.94 across all datasets. reduced by about half, prediction 30%.
منابع مشابه
Constructing response model using ensemble based on feature subset selection
In building a response model, determining the inputs to the model has been an important issue because of the complexities of the marketing problem and limitations of mental models for decision-making. It is common that the customers’ historical purchase data contains many irrelevant or redundant features thus result in bad model performance. Furthermore, single complex models based on feature s...
متن کاملCorrelation-based Feature Selection for Machine Learning
A central problem in machine learning is identifying a representative set of features from which to construct a classification model for a particular task. This thesis addresses the problem of feature selection for machine learning through a correlation based approach. The central hypothesis is that good feature sets contain features that are highly correlated with the class, yet uncorrelated w...
متن کاملClassification of lung cancer using ensemble-based feature selection and machine learning methods.
Lung cancer is one of the leading causes of death worldwide. There are three major types of lung cancers, non-small cell lung cancer (NSCLC), small cell lung cancer (SCLC) and carcinoid. NSCLC is further classified into lung adenocarcinoma (LADC), squamous cell lung cancer (SQCLC) as well as large cell lung cancer. Many previous studies demonstrated that DNA methylation has emerged as potential...
متن کاملEnsemble-based Feature Selection Criteria
Recursive Feature Elimination (RFE) combined with feature ranking is an effective technique for eliminating irrelevant features when the feature dimension is large, but it is difficult to distinguish between relevant and redundant features. The usual method of determining when to stop eliminating features is based on either a validation set or cross-validation techniques. In this paper, we pres...
متن کاملAn Intelligence-Based Model for Supplier Selection Integrating Data Envelopment Analysis and Support Vector Machine
The importance of supplier selection is nowadays highlighted more than ever as companies have realized that efficient supplier selection can significantly improve the performance of their supply chain. In this paper, an integrated model that applies Data Envelopment Analysis (DEA) and Support Vector Machine (SVM) is developed to select efficient suppliers based on their predicted efficiency sco...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Metals
سال: 2023
ISSN: ['2075-4701']
DOI: https://doi.org/10.3390/met13010169